AITopics | inverse hessian

Collaborating Authors

inverse hessian

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

e6385d39ec9394f2f3a354d9d2b88eec-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 21:36:20 GMT

checkpoint, computation, test example, (15 more...)

Neural Information Processing Systems

Country:

South America > Colombia > Bogotá D.C. > Bogotá (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Switzerland (0.04)
(2 more...)

Industry: Government (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

Add feedback

7cfd5df443b4eb0d69886a583b33de4c-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 12:35:38 GMT

algorithm, approximation, gradient, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Austria (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Batch Acquisition Function Evaluations and Decouple Optimizer Updates for Faster Bayesian Optimization

Irie, Kaichi, Watanabe, Shuhei, Onishi, Masaki

arXiv.org Artificial IntelligenceDec-10-2025

Bayesian optimization (BO) efficiently finds high-performing parameters by maximizing an acquisition function, which models the promise of parameters. A major computational bottleneck arises in acquisition function optimization, where multi-start optimization (MSO) with quasi-Newton (QN) methods is required due to the non-convexity of the acquisition function. BoTorch, a widely used BO library, currently optimizes the summed acquisition function over multiple points, leading to the speedup of MSO owing to Py-Torch batching. Nevertheless, this paper empirically demonstrates the suboptimality of this approach in terms of off-diagonal approximation errors in the inverse Hessian of a QN method, slowing down its convergence. To address this problem, we propose to decouple QN updates using a coroutine while batching the acquisition function calls. Our approach not only yields the theoretically identical convergence to the sequential MSO but also drastically reduces the wall-clock time compared to the previous approaches. Our approach is available in GPSampler in Optuna, effectively reducing its computational overhead.

artificial intelligence, machine learning, optimization, (16 more...)

arXiv.org Artificial Intelligence

2511.13625

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.89)

Add feedback

Distributed estimation of the inverse Hessian by determinantal averaging

Neural Information Processing SystemsAug-19-2025, 23:42:22 GMT

An example of this occurs in distributed Newton's

determinantal averaging, matrix, optimization, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

f7ac67a9aa8d255282de7d11391e1b69-Supplemental.pdf

Neural Information Processing SystemsAug-17-2025, 08:34:43 GMT

approximation, implementation, influence function, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

e6385d39ec9394f2f3a354d9d2b88eec-Supplemental.pdf

Neural Information Processing SystemsAug-17-2025, 01:28:30 GMT

artificial intelligence, checkpoint, machine learning, (17 more...)

Neural Information Processing Systems

Country:

South America > Colombia > Bogotá D.C. > Bogotá (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Switzerland (0.04)
(2 more...)

Industry: Government (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)

Add feedback

M-F AC: Efficient Matrix-Free Approximations of Second-Order Information

Neural Information Processing SystemsAug-15-2025, 10:07:27 GMT

Efficiently approximating local curvature information of the loss function is a key tool for optimization and compression of deep neural networks.

algorithm, approximation, gradient, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Austria (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Reviews: Distributed estimation of the inverse Hessian by determinantal averaging

Neural Information Processing SystemsJan-26-2025, 13:11:22 GMT

Additional Comments: • Overall, the article is well-written and structured. It has a clear contribution and also significant theoretical justification. There are only a few mistyping and grammatical errors: Line 17: "tranformation" "transformation" Line 133: " its entries is of …" "its entries are of …" Line 146: " accross" "across" Line 150: " emprical" "empirical" • In line 54, it is better to define in the theorem 2 along with . If there is no constraint on the number of estimators, this means choosing subsamples is with replacement. However, in line 158, the authors claim that their method is without replacement.

determinantal, estimation, inverse hessian, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.73)

Add feedback

Stochastic Quasi-Newton Optimization in Large Dimensions Including Deep Network Training

Suman, Uttam, Mamajiwala, Mariya, Saxena, Mukul, Tyagi, Ankit, Roy, Debasish

arXiv.org Artificial IntelligenceOct-18-2024

Our proposal is on a new stochastic optimizer for non-convex and possibly non-smooth objective functions typically defined over large dimensional design spaces. Towards this, we have tried to bridge noise-assisted global search and faster local convergence, the latter being the characteristic feature of a Newton-like search. Our specific scheme -- acronymed FINDER (Filtering Informed Newton-like and Derivative-free Evolutionary Recursion), exploits the nonlinear stochastic filtering equations to arrive at a derivative-free update that has resemblance with the Newton search employing the inverse Hessian of the objective function. Following certain simplifications of the update to enable a linear scaling with dimension and a few other enhancements, we apply FINDER to a range of problems, starting with some IEEE benchmark objective functions to a couple of archetypal data-driven problems in deep networks to certain cases of physics-informed deep networks. The performance of the new method vis-\'a-vis the well-known Adam and a few others bears evidence to its promise and potentialities for large dimensional optimization problems of practical interest.

artificial intelligence, evolutionary algorithm, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2410.1427

Country:

North America > United States (0.28)
Asia > India (0.28)

Genre: Research Report (0.40)

Industry: Energy > Oil & Gas > Upstream (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Mathematics of Computing (0.93)

Add feedback

Distributed estimation of the inverse Hessian by determinantal averaging

Neural Information Processing SystemsOct-10-2024, 18:42:18 GMT

In distributed optimization and distributed numerical linear algebra, we often encounter an inversion bias: if we want to compute a quantity that depends on the inverse of a sum of distributed matrices, then the sum of the inverses does not equal the inverse of the sum. An example of this occurs in distributed Newton's method, where we wish to compute (or implicitly work with) the inverse Hessian multiplied by the gradient. In this case, locally computed estimates are biased, and so taking a uniform average will not recover the correct solution. To address this, we propose determinantal averaging, a new approach for correcting the inversion bias. This approach involves reweighting the local estimates of the Newton's step proportionally to the determinant of the local Hessian estimate, and then averaging them together to obtain an improved global estimate.

determinantal, estimation, inverse hessian, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Mathematics of Computing (0.65)
Information Technology > Artificial Intelligence (0.41)

Add feedback